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. The notion of weak measurement provides a formahsm for extracting information from a quantum 

' system in the hmit of vanishing disturbance to its state. Here we extend this formahsm to the 

\ measurement of sequences of observables. When these observables do not commute, we may obtain 

^SJ . information about joint properties of a quantum system that would be forbidden in the usual strong 

measurement scenario. As an application, we provide a physically compelling characterisation of 
the notion of counterfactual quantum computation. 
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CLi' I. INTRODUCTION 



Quantum mechanics is still capable of giving us surprises. A good example is the concept of weak measurement 
discovered by Aharonov and his group [H, which challenges one of the canonical dicta of quantum mechanics: that 
non-commuting observables cannot be simultaneously measured. 

Standard measurements yield the eigenvalues of the measured observables, but at the same time they significantly 
disturb the measured system. In an ideal von Neumann measurement the state of the system after the measurement 
becomes an eigenstate of the measured observable, no matter what the original state of the system was. On the 
QQ ' other hand, by coupling a measuring device to a system weakly it is possible to read out certain information while 
\ limiting the disturbance to the system. The situation becomes particularly interesting when one post-selects on a 

■ particular outcome of the experiment. In this case the eigenvalues of the measured observable are no longer the 

■ relevant quantities; rather the measuring device consistently indicates the weak value given by the AAV formula 

O' A m 



where A is the operator whose value is being ascertained, ji/'i) is the initial state of the system, and is the state 
that is post-selected (e.g. by performing a measurement). The significance of this formula is that, if we couple a 
measuring device whose pointer has position coordinate q to the system S, and subsequently measure g, then the 
mean value (g) of the pointer position is given by 

{q)^gRe[AJ^, (2) 

where Re denotes the real part. This formula requires the initial pointer wavefunction to be real and of zero mean, 
but these assumptions will be relaxed later. The coupling interaction is also taken to be the standard von Neumann 
measurement interaction H = gAp. The coupling constant g is assumed to be small, but we can determine to any 
desired accuracy if enough repeats of the experiment are carried out. 

The formula ([1]) implies that, if the initial state is an eigenstate of a measurement operator A, then the weak 
value post-conditioned on that eigenstate is the same as the classical (strong) measurement result. When there is 
a definite outcome, therefore, strong and weak measurements agree. However, weak measurement can yield values 
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outside the normal range of measurement results, eg spins of 100 [J]. It can also give complex values, whose imaginary 
part correspond to the pointer momentum. In fact, the mean of the pointer momentum is given by 

(p) = 2gv Im[A^], (3) 

where Im denotes the imaginary part and v is the variance in the initial pointer momentum. 

The fact that one hardly disturbs the system in making weak measurements means that one can in principle measure 
different variables in succession. We follow this idea up in this paper. 



II. A NEW PARADOX 



Weak measurement has proved to be a valuable tool in analysing paradoxical quantum situations, such as Hardy's 
paradox 0, 0] ■ To illustrate the idea of sequential weak measurement and its potential applications we first construct 
a new quantum paradox. Consider the double interferometer, the optical circuit shown in Figure [TJ where a photon 
asses through two successive interferometers. This configuration has been considered previously by Blasi and Hardy 
in another context. Using the labels of the paths shown in the figure, and denoting the action of the i-th beam- 
sphtter by Ui, the system evolves as follows: 



Ur\A) = i\B) + \C))/V2, 

U,\B) = m + \F))/V2, U2\C) = i\E) |F))/V2, 
Us\E) = {-\D) + \D'))/V2, Us\F) = m + m)/V2. 



(4) 
(5) 
(6) 



(The signs here are determined by the fact that reflection on the silvered outer surface of a beam-splitter gives a phase 
of TT whereas transmission or reflection by the inner surface gives zero phase.) 
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FIG. 1: The double interferometer: an optical circuit in which a photon, injected along path A, passes through two inter- 
ferometers, represented by paths B and C and paths E and F. Finally, the photon is post-selected at the detector D. The 
beam-splitters are shown with their reflecting surface marked in black. 



Suppose now that we select a large number N of successful runs of our experiment, i.e. those runs where the photon 
is detected by the detector D. 

We can now make the following statements about this situation: 

(1) All photons go through path E. 

Indeed, equations (|4]) and ([5]) tell us that if a photon is injected along path A, it must exit the flrst interferometer 
along path E. Consequently, if we measure the observable Pe, the projector for path E, we flnd the total number of 
photons detected is Ne — N with certainty. 

(2) All photons go through path C . 

Indeed, the second interferometer is arranged in such a way that any photon entering along path B will end up at 
D' . Hence, a very simple calculation shows that if, instead of measuring Ne, we measure Nc, the number of photons 
going along path C in all N runs of the experiment, we will obtain with certainty Nc = N . 

(3) When photons go through path C , a subsequent measurement reveals that half of them must go through path E 
and half through path F . 

Indeed, if we measure the position of the photons in the first interferometer and flnd that all go via C, then a 
subsequent measurement of Ne and Np must yield N/2 in each case, up to statistical fluctuations. (In fact this is 
true regardless of whether or not all photons end up eventually at D). 
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(4) When photons go through path E, a subsequent measurement reveals that half of them must have come via path 
B and half via path C . 

This last statement is similar to point (3) above. 

The above four statements seem to imply a paradoxical situation. On the one hand, statement (2) tells us, when 
we pool all the results, that all TV photons go via path C; together with statement (3) this implies that the number of 
photons that go along path E must be N /2. On the other hand, statement (1) tells us that all N photons actually go 
along path E\ A similar contradiction arises in connection with the number of photons going along path C . On the 
one hand, statement (1) tells us that all photons go via E] together with statement (4) this implies that the number 
of photons that go along path C must be only N/2. On the other hand, statement (2) tells us that all N photons 
actually go along path C! 

The usual way of resolving this paradox is to say that the above statements refer to measurements that cannot 
all be made simultaneously. Indeed, it is true that if we measure Pe we find it is 1 with certainty, but only if we 
do not also measure Pc- If we also measure Pc in the same experiment, then it is no longer the case that Pe — 1. 
Similarly, it is true that Pc = 1 with certainty, but only if we do not also measure Ne- If we also measure Pe in the 
same experiment, then it is no longer the case that Pe = 1. So, we are told, the statements (l)-(4) above have no 
simultaneous meaning, for they do not refer to the same experiment. Hence there is no paradox: In formulating the 
paradox presented above we made use of facts that are not all simultaneously true. 
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FIG. 2: Paths through the double interferometer, and the number of photons that follow the indicated path. Thus for instance 
Nbe ~ N/2. Note however the curious prediction Nbf ~ —N/2. 

On the other hand, as is emphasised in [3| , one should not dismiss such paradoxes too lightly. Indeed it is possible 
to make a trade-off: By accepting some imprecision in measuring Pe, Pc, etc., we can limit the disturbance these 
measurements produce. The way to do this is to weaken the coupling of the measuring devices to the photons. 

Since the disturbance is now small, we can make all the measurements in the same experiment, and we expect all 
the statements (l)-(4) to be true. Hence we expect Ne — N, Nc = N and obviously Np = and Nb = 0. On the 
other hand, we also expect that Nce, and Ncf, the total numbers of photons that went along C and subsequently 
along E or F, respectively, should both be equal to N/2; this is because all the N photons go via C and half of 
them should continue along E and half along F. Also we expect Ncf, the number of photons that went along C and 
subsequently along E, to be Nce = N/2. Similarly we expect that Nce and Nbe should both be N/2, since all N 
photons go along E and half of them must come via B and half via C. 

While all the above predictions seem reasonable, here is the surprise: Overall we have only N photons. They could 
have moved along four possible trajectories: BE, BE, CE or CF. Since Nbe + Nbf + Nce + Ncf — 1 and since 
Nbe — Nce — Ncf = N/2 it must be the case that Nbf — -'N/21 Furthermore, our prediction has a remarkable 
internal consistency. We know that the total number of photons that go along F must be zero. They can arrive at 
F in two ways, either by BE or CF. Thus Nf = Nbf + Ncf- As noted above, Ncf = N/2, but no photons are 
supposed to go through F. This is due to the fact that Nbf is negative, i.e. Nbf = ~-N/2. 

The above predictions seem totally puzzling, no less puzzling than the original paradox. However, what we have 
now is not a mere interpretation that can simply be dismissed. These are now predictions about the results of real 
measurements - in particular the weak measurement of the number of photons that passes along path B and then 
along path F. This is a two-time measurement. 

In general, by ensuring that the measurement interaction is weak, we can consider sequences of measurements. 
Describing such measurements is the main subject of our paper. In the process, we will formally derive the strange 
predictions made above for the double interferometer, and will discuss the interpretation of weak measurements. Fi- 
nally, we apply these ideas to counterfactual computation, which is a catch-all for numerous counterfactual phenomena 
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including, for example, interaction- free measurement Q- 

III. SEQUENTIAL WEAK MEASUREMENTS 

The situation we shall consider is where a system S evolves unitarily from an initial state \ipi) to a final post- 
selected measurement outcome ("0/1- At various points, observables may be measured weakly. Here we consider the 
scenario where there is a single copy of the system, with the measuring device weakly coupled to it. Generally, reliable 
information will only be obtained after many repeats of the given experiment. 

In the simplest case where there is just one observable, A say, we assume the evolution from j^;) to the point where 
A is measured is given by U, and from this point to the post-selection the evolution is given by V. Then we can 
rewrite ^ as: 

_{Jjf\VAU\i^ 

and the mean of the pointer is given by ([2]) as before. 

Consider next the case of two observables, Ai and A2, measured at different times on a system S. We assume 
the system evolves under U from \ipi) to the point where Ai is measured, then under V to the point where A2 is 
measured, and finally under W to Our strategy is to use two measuring devices for measuring Ai and ^2- Let 

the positions of their pointers be denoted by qi nd (72, respectively. We couple them to the system at successive times, 
measure qi and 92, and then take the product qiq2- 

We begin, therefore, with the weak coupling of system and pointers, with the usual von Neumann-type Hamiltonians 
for measuring Ai and A2. The state of system and pointers after this coupling is: 

"^SM.M, = e-'aP'''We-'<'P^^^U\^,)sHqi)Hq2), (8) 

where pi and p2 are the two pointer momenta (the label S refers to the system and A4i, M2 to the pointers). Here 
4>{q) is the initial pointer distribution, and we have assumed, for simplicity, that the two pointers have identical initial 
distributions and equal coupling constants g. Post-selecting on (^/| gives the state of the pointers as 

^M^M■, = (V^/|T^e-'3f^^^ye-^»Pi-4^C/|V,)(/'(gi)</'(g2). (9) 
As g is small, we can approximate the state as: 

"^M.M, = {i>f\ {w{\ - igp2A2 - fplAj + . . .)V{1 - igpiAi ~ fp^Aj + . . .)U^ |V.)0(qi)<^(<?2). (10) 

Putting p — —id/dq, we get 

"^M.M, = ^ [0(9i)</'(g2)-.g(^i)^0'((7i)</'(g2)-ff(^2)^</'(gi)0'(<Z2) + Y(A2)^0''(<7i)</)(g2) (11) 

+ y (^2)»'/>('Zi)</>"(92) + g'iA2,A,)^^'iq,)^'iq2) + Oig^)] 

where F = {^f\WVU\t^^), (Ai)^, = (V-/! WAi?7|^.)/F, {Aj)^ = {'^f\WVAlU\ij,)/F, {A2U = {^f\WA2VU\ib^)/F, 
(Al)^ = {%l;f\WAlVU\'4>i)/F and (^2,^1)^ is defined by 

{^f\WA2VA,U\^,) 

{^f\wvum ■ 

Following measurement of qi and (72, the expected value of their product is given by 

, , S qiq2\^M^M^'^dq 

9192 = — yr\ • 13) 

For simplicity, let us make the following assumption (we will discuss the general case later): 
Assumption A: The initial pointer distribution (f> is real-valued, and its mean is zero, i.e. J q(j)^{q)dq = 0. 
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We also assume, without loss of generality, that (p is normalised so that J 0^ = 1. With these assumptions, 
all the terms in (|13p of order and 1 in g vanish, and we are left with 



{qiq2) ^g^[iA2,A,)^ + {A2,Ai)^ + {Ai)JA2)^ + {Ai)^{A2)J / q4>{qW{q)dq 



(14) 



where bars denote complex conjugates. Integration by parts implies / q(f){q)(j)' {q)dq = — ^, so we get the final result 



(gi92) = Y Re [(A2,^i)„ + (Ai)^(A2)^ 



(15) 



Here (A2, Ai)^ is the sequential weak value given by (|12p: note the reverse order of operators, to fit with the convention 
of operating on the left. 



IV. THE SEQUENTIAL WEAK VALUE 

In the section above we considered two measurements - a measurement of Ai at time ti and of A2 at ^2 ~ and we 
looked at the product of the outcomes ^192 in the limit when the coupling of the measuring devices with the measured 
system was weak. This procedure was motivated by our example of the double interferometer: we wanted to check 
whether the photon followed a given path, say the path that goes along C in the first interferometer and then along 
E in the second interferometer. In that case the variables of interest are Pc, the projector on path C and Pe, the 
projector on path E. When the photon follows this path, the value of the product of these projectors is 1 while in all 
other situations the product is 0. We wanted to see what the behavior of the photon was when the measurements did 
not disturb it significantly. 

Since qi measures Ai and q2 measures A2, it seems obvious that the quantity that represents the product of the 
two observables is (9192) given in p4p above. However, the situation is more subtle, as we show below. 

Consider the simpler case of two commuting operators Ai and A2 , and suppose we are interested in the value of the 
product A2A1 at some time t. (Note that we are now talking about operators at one given time, not at two different 
times.) We can measure this product in two different ways. First, we can measure the product directly, by coupling a 
measuring device directly to the product via the interaction Hamiltonian H = gpA2Ai. When we make the coupling 
weaker, we find that the pointer indicates the value 

(,) = gReiA,A2U = gRe^-^^^. (16) 

This is straightforward: it is simply the weak value of the operator A2A1. On the other hand, we could attempt to 
measure the product in the same way that we measured the sequential product. That is, we can use two measuring 
devices with pointer position variables qi and (72, couple the first measuring device to Ai and the second to A2, and 
then look at the product qiq2- The latter method was proposed by Resch and Steinberg [1] for the simultaneous 
measurement of two operators. They showed that in this case 

(9192) = Y -Re 



(AiA2)^ + (Ai)^(A2)^J . (17) 

We see that the value indicated by (91(72) is not equal to the weak value of the product, but contains a supplementary 
term, i?e(Ai)m(A2)^. In other words, although we expected the two methods to be equivalent, it is not the case. To 
obtain the true weak value of the product we must subtract this second term. This second term is an artifact of the 
method of using two separate measuring devices rather than coupling one measuring device directly to the product 
operator. 

In the case of sequential measurement there is no product operator to start with, for we are interested in the 
product of the values of operators at two different times. Hence the first method, of coupling directly to the product 
operator, makes no sense, and we must use two independent couplings. In order to obtain the quantity of interest, i.e. 
the qua ntity t hat is relevant to situations such as the double interferometer of Section [ill we must subtract the term 
Re(Ai)t^{A2)^ from We thus conclude that the quantity of interest is the sequential weak value given in 
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V. GENERAL SEQUENTIAL WEAK MEASUREMENT 



Sequential weak measurement can be easily extended to n measurements of Hermitian operators Ai with intervening 
unitary evolution steps Ut. The weak values are given by 



(A„,...,Ai)^ 



{^}\Un+lAnUn---AiUi\lP,) 
{'^f\Un+lUn...Ui\^,) - 



(18) 



and the expected values ((71(72 • • • (In) can be expressed in terms of these weak values. For example, with Assumption 
A 



('719293) 



Re 



{A^,A2,Ai)^ + {A2,Ai)^{A^)^ + (^3,^1)^(^2)^ + (^3,^2)^(^1)^ 



(19) 



and the case of general n is given in the Appendix. Similarly, we can express expected values for products of momenta 
in terms of the weak values (see Appendix). For instance 



(P1P2) = ^gvfRe [-{A2,Ai)^ + (^1)^(^2), 
Mixed products of positions and momenta give similar formulae. For instance 

(giP2) = -g^V Im \{A2,Ai)^+jAi),JA2)n, 



(20) 



(21) 



The foregoing examples illustrate a general pattern, which is that expectations of products of p's and (7's depend 
on the real part of sequential weak values if there is an even number of p's in the product and on the imaginary part 
if there is an odd number of p's. 

The sequential weak values satisfy the following rules: 

1) Linearity in each variable separately: 

(>1„, . . . , Ai, . . . , Ai)yj + (^n, . . . , A^, . . . , Ai)yj — (v4„, . . . , (^Ai + ^i), . • ■ , ^l)lU, 

for any \ < i < n. 

2) Agreement vifith strong measurement: 

Suppose that, with preselection by ji/'i) and post-selection by \ipf), strong measurements of Ai, A2, . . . , An always 
give the same outcomes ai, 02, . . . , a„; then (A„ . . . ^i)^, — 0102 . . . a^. 

3) Marginals: If / is the identity operator at location i: 



{An, ■ ■ ■ Ai+i, Ai-i, . . . , Ai)„ — ^^(yl„, . . . Ai+i, I, Ai-i, . . . , Ai)i 



We can illustrate some of these rules with the double interferometer experiment (figure [T]). The measurements we 
consider are projectors that detect the presence of a photon on various edges; for instance, the projector Pb indicates 
whether a photon is present on the edge B. For simplicity we write for the weak value {Pb)w, etc., and we use 
the same convention for sequential weak values. Then using we find = 1, B„ = 0, -E^, = 1 and = 0. Using 
(ini) we find {E,B)^ = 1/2, {F,B)^, = -1/2, {E,C)^, = 1/2 and (F,C)™ = 1/2. Since Pe + Pp = I, rule 1) implies 
{E, B)w + (F, B)^ = (/, B)yj, and then rule 3) implies (/, B) = B^- Thus we expect {E, B)^ + (F, B)^ — B^, which 
holds if we substitute the values above. Similarly {E, C)w + {F, C)w = 1/2 + 1/2 = C^, and so on. As for rule 2), we 
have seen (Section |lT]) that strong measurement of Pc and Pe yields 1, so we expect the weak values to be the same, 
as is the case. 

There is a further rule that applies when one of the operators being measured is a projector. Wc illustrate it with 
the double interferometer. We can write 

{E,C)u, ^ {D\U3PeU2\C) {C\Ui\A) ^ {D\UzPeU2\C) ^ E^ 

(F,C)„ {D\U3PfU2\C) {C\Ui\A) (D\U3PfU2\C) F^' ^ ' 

Here E^ and F^ in the final ratio are calculated assuming that — |C), in other words, as though we were 
calculating weak values for the second interferometer treated separately from the rest of the system, with initial state 
|C) and post-selection by \D) (Figure [31). If we only knew the single- measurement weak values E^, F^ and C^, we 
could calculate (E, C)w and {F, C)w using this rule and the relationship [E, C)w + {F, C)^ = Cyj derived above. 
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FIG. 3: The double interferometer restricted to its second interferometer. According to (|22[) . the ratio of the weak values 
E-w/F-w in the second interferometer, with photons injected along C, is the same as the ratio of the sequential weak values 
{E, C)io/{F, C)m in the double interferometer with photons injected along A. 

VI. THE MEANING OF WEAK VALUES 

Consider some experiment in which we inject some kind of particle and weakly measure the projector onto some 
location X. Suppose we collect some large number N of runs of the experiment that satisfy the post-selection criterion. 
We interpret the fact that the projector at X has weak value Xw to mean that, for any appropriate physical property 
we test, due for instance to the charge, gravitational field, etc. of the particle, it is as though NX^^, particles (up to 
a binomial distribution error) passed along X . Thus in the double interferometer experiment we expect all physical 
tests to give outcomes appropriate to there being, in all N runs of the experiment, a total of Ne = NEy^ = N photons 
passing along i5, Nqe = N/2 photons passing along C then _E, and so on. 

Can we justify the foregoing interpretation of weak values? For weak measurements of a single operator, there 
is a body of work showing that weak values, even when they lie in an unexpected range, can be treated as though 
they were the actual values in the underlying physical theory and will then yield correct predictions. Exarnples of 
this include weakly measured negative kinetic energies when a particle is in a classically forbidden region [9|, and 
weakly measured faster-than-light velocities that are associated with Cerenkov radiation [10]. If a measure is entirely 
consistent with physics in this fashion, then we are entitled to say that it is telling us a true physical fact. For 
sequential weak values, we can make a similar argument. The physical meaning of sequential weak values needs to 
be explored in many physical situations to give the kind of justification that single weak values enjoy. However, the 
internal consistency is already clear from the double interferometer example, and, more generally, from the rules in 
Section El 

VII. BROADENING THE CONCEPT: WEAK INTERACTIONS 

So far, we have considered ideal weak measurements, in which the pointer distribution is real and has zero mean 
(Assumption A). If we drop these assumptions, we find in place of Q that 

(<?) = A* + g{Re[A^] + Im[A^]y), (23) 

where y — J 4>ipq + qp)4'dq — with /i = / 4>q(f>dq, v — ^ (j)p4>dq. 

The expectation (rir2 . . .Tn) for a general initial pointer distribution, where each ri is either qi or pi, is a very 
complicated expression, but, so far as the system goes, depends only on the real and complex parts of sequential weak 
values up to (A„, . . . Ai)^. Thus we can write 

) = $(i?e(A„, . . . v4i)„, /m(A„, . . . ^i)„, . . . , i?e(yl„)„, /m(A„)«,, ■ • ■ , i?e(Ai)„, /m(Ai)„), (24) 

for some polynomial function The coefficients in $ are themselves polynomials in expectations J fj{pi, qi)fdq for 
polynomials 7, as we see in the case of equation (I23p . where y has this form. 

In the next section, we shall want to consider the most general possible type of weak interaction which allows any 
sort of (suitably weak) coupling between the system and an ancilla followed by any further evolution or measurement 
of the ancilla alone (the pointer in our previous discussion and its von Neumann measurement interaction gpA will 
be a special case of such an ancilla and weak interaction). Our notion of general weak interaction is the following: 
Consider the system and ancilla initially in product state Let i?s,anc be any Hamiltonian of the joint system, 

and g a coupling constant. For a single interaction event, and to first order in g, the state becomes 

(/-^5i^S,anc)|V')IO■ (25) 
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Any joint Hamiltonian may be expressed as a sum of products of individual Hamiltonians 

ifs,anc=E^S®^fanc- (26) 
k 

Post-selecting the system state in equation (pS)) with \ipf) gives 

*anc = (V'/|V'.)[/a„c " ^5 ^ (^S )-^fanc] 10 ; (27) 

k 

So the system Hamiltonians iJ| have been effectively replaced by their weak values (i?!)™- The important point here 
is that all subsequent manipulations of the ancilla will depend on the pre- and post-selected system only through weak 
values of suitably chosen observables. A similar result clearly holds for any sequential weak interactions and suitably 
associated sequential weak values, and also for terms of any higher order in g. 

As a simple illustrative example, suppose that the ancilla is the pointer system of a von Neumann measurement 
interaction with Assumption A in force, and that this same pointer is weakly coupled twice for the sequential mea- 
surement of both Ai and A2. If this pointer has position q and momentum p, the pointer state after post-selection 
is 

= (V-zl {U3e~'aP''-U2e-'aP^'Ui) </>(?), (28) 

yielding 

(q) Re [(Ai)„ + (Aa)^] . 

The effect in this instance is therefore the same as adding the individual post-measurement results, and it depends 
on the system only through associated weak values. 



VIII. COUNTERFACTUALITY AND WEAK MEASUREMENT 

Counterfactual computation [ill . [T3 | provides a general framework for looking at counterfactual phenomena, in- 
cluding interaction-free measurement as a special case. We consider arbitrary protocols, at various points of which a 
quantum computer can be inserted. The computer has a switch qubit (with |0)=off and |l)=on) and an output qubit. 
A special case of this formalism is where the protocol is represented by an optical circuit, and a computer insertion 
means that the computer (or a copy of it) is placed in some path of the circuit and is switched on by a photon passing 
along that path. 

We assume that the computer is programmed ready to perform a computational task with answer or 1 which 
will be written into the output qubit if the switch is turned on. In addition to the switch and output qubits, the 
protocol will in general have additional qubits, and will involve some measurements. We say that an outcome of these 
measurements determines the computer output if that outcome only occurs when the computer output has a specific 
value, |0) or |1). Such an outcome is said to be counterfactual if its occurrence also implies that the computer was 
never switched on, i.e. its switch was never set to |1), during the protocol. 

To make this precise, note first that one can always produce an equivalent protocol in which the state is entangled 
with extra qubits and the measurement deferred to the end of the protocol. Thus the protocol can be assumed to 
consist of a period of unitary evolution followed by a measurement, which can be assumed (again by adding extra 
qubits) to be a projective measurement. Let IV^i) be the initial state of the protocol, and let \ipf) be a measurement 
outcome that determines some specific computer output, in the sense defined above. Suppose the computer is inserted 
n times. Let T (for "oFf") denote the projection |0)(0| onto the off value of the computer switch and J\f (for "oN") 
denote the complementary projector |1)(1|, and let ^ be one of the 2" possible strings of T^s or A/''s of length n; we 
call this a history. Let Ui denote the unitary evolution in the protocol between the (i — l)th and ith insertions of the 
computer. 

Definition VIII. 1 (Counterfactuality by histories p^). The measurement outcome \^jjf) is a counterfactual outcome 

1) I'tjjf) determines the computer output. 

2) The amplitude of any history ^ containing an Af vanishes. In other words, for all histories ^ other than the all-T 
history, (V'/|C/„+i^„[/„ . . . J/aCiJT'ilV'i) = 0. 



8 



c 


1 


E \ 






F 




D 








D' 



FIG. 4: The double interferometer of Figure [T] treated as a protocol with computer insertions (black rectangles) in paths B 
and F. If a photon passes down either of these paths, the computer runs. 



One may question whether this is the "correct" definition of a notion of counterfactual computation or whether 
alternative definitions might be convincingly plausible. Condition 1) is uncontroversial but condition 2) might seem 
less immediately compelling. It is evidently equivalent to obtaining a null result if we carry out a strong non-demolition 
measurement of N at each computer insertion. However the disturbance that such a measurement causes might lead 
one to question the suitability of this condition. Indeed recently Hosten et al. proposed an alternative definition 
of counterfactual computation that violates condition 2) of definition VIII. 1 and sparked a controversy [ij] over 
the relative merits and validity of the two notions. We will now develop some alternative characterisations of our 
definition VIII. 1 in terms of weak measurements, thereby addressing the disturbance issue. We will argue that these 
new characterisations considerably strengthen the credibility of the original definition as the "correct" one. 

Let us therefore consider carrying out a weak measurement of Af at each insertion. A non-zero weak value implies 
that there is a detectable physical effect that can only occur if the computer is switched on. Vaidman's treatment of 
the three-box paradox [15] gives a good example of this reasoning. 

Our two-interferometer example shows that it does not suffice to consider the individual weak values at each 
insertion. For suppose the computer is inserted in paths B and F, as shown in Figure [H Then we have seen that 
the weak values By^, and F^^ are zero, yet the sequential weak value (F, B)^ is non-zero. The non-vanishing of the 
sequential weak value implies that a photon passes along both path B and F , since there is a physical effect that 
causes correlated deflections of pointers at both sites. 

There is a subtlety here, because it could be argued that, because sequential pairwise weak measurements give 
second-order effects in g (see (|15p ). we might detect a departure from zero in the weak measurements for each 
operator individually, i.e. in the deflections of the pointers at B and F, if we looked at second or higher order terms 
in g. However, if A is any projector and — 0, then the von Neumann interaction e"*^^"* reduces to Ae^^^P + / — A, 
which is the identity to all orders in g in the weak measurement calculation. Thus we truly need to carry out the 
sequential weak measurement here to identify the physical effect due to the photon. 

In general, we need to consider all possible sequential weak measurements to obtain an adequate test of counter- 
factuality. This is why we must use weak rather than strong measurements. As we have seen in Section lIVl there is 
no strong measurement corresponding to sequential weak measurements. 

We therefore propose the following: 

Definition VIII. 2 (Counterfactuality by weak values). The measurement outcome \ipf) is a counterfactual outcome 

1) Itpf) determines the computer output. 

2) (A/i^,A/i^_j^, . . .A/ii)ui = 0, for any 1 < ii < 12 < . . . < ife < where n is the number of insertions of the 
computer. 

By (HH]), conditions 2) for lVIII.l1 and lVIII.2l are equivalent, using the fact that !F+Af = 1 together with the linearity 
and marginal rules. For instance, with two insertions of the computer, condition 2) of Definition IVIII. 11 amounts to 
(7Vi,7V2)t„ = 0, {Ti,M2)w = and {A f 1,^2) t o = 0, and these imply (7Vi)t„ 0, {Af2)w = and (7Vi,7V2).u> = 0, which 
constitute condition 2) for Definition IVIII. 2[ 

We can try to strengthen the requirements for counterfactuality by demanding that a zero response is obtained for 
any conceivable weak interaction, in the sense of the preceding section. In our present application we must further 
restrict the weak interaction to take place only if the switch has the property of being "on", i.e. the interaction 
Hamiltonian must have the form (A/"® ^anc)^^s,anc(A/' (8" /anc)- We say that such an interaction is a weak interaction 
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involving the projector Af. Since A/" is a one-dimensional projector, this implies that the interaction Hamiltonian 
has the form Af (E) -ffanc- In a more general scenario the projector Af for counterfactuality (analogous to the switch 
being " on" ) may have rank larger than 1 and then the interaction Hamiltonian may have the more general form 
(Af Ia.nc)Ms^stnc{Af 1^ lane) for any Hermitian M. For example, the switch may be a photon with both path and 
polarisation properties. Then a weak interaction restricted to its presence on a path would correspond to a two- 
dimensional projector on its polarisation state-space associated to that path. 

Deflnition VIII. 3 (Counterfactuality by general weak interactions). The measurement outcome \ijjf) is a counter- 
factual outcome if 

1) I'i/'/) determines the computer output. 

2) Any possible weak interaction involving the projections Afi, . . .Afn yields a null result. 

By a null result, we mean the same result that would be obtained for g = 0. It is not difficult to show that this 
apparently much broader concept is in fact equivalent to Definition IVIII.2I In one direction, we know from the last 
section that any expectation depends only on the sequential weak values, involving the projectors Afi, so when these 
weak values vanish we obtain a null result. In the other direction, we have only to show that we can choose particular 
weak interactions whose null results will imply the vanishing of all sequential weak values. However, if we first obtain 
a null value of (g^) and {pi) for the standard von Neumann measurement weak interaction for every i, then we know by 
and ([3]) that both real and imaginary parts of all the weak values {Ni)w are zero. Then by obtaining null values of 
{qiqj) and {piqj) for all i < j, we infer from and pT|) that the real and imaginary parts of all {Afj,Afi)w are zero. 
We continue this way, using the fact that expectations of products of p's and (/'s with an even number of p's depend 
on the real part of sequential weak values, whereas those with an odd number of p's depend on their imaginary parts 
(see Appendix). 

We have therefore proved: 

Theorem VIII. 4. All three definitions. I VIII. 1[ WlII.^ and VVIII.^ are equivalent. 

IX. DISCUSSION 

Sequential weak values are a natural generalisation of the weak value of a single measurement operator Q . Resch 
and Steinberg's simultaneous measurement of two operators [1] gives the same result in the special case where these 
operators commute, but it does not address the case where we have a succession of measurements with unitary 
evolution between them. 

One can argue that both single and sequential weak measurements tell us what the physical situation is. In the 
double interferometer, for instance, = 1 really means that all the photons go via C, and {E,C)w = 1/2 really 
means that approximately half the photons go via C followed by E. This is of course a matter of interpretation, and 
may be disputed; but at least it seems to be true that weak values can be fitted into the framework of physics without 
contradiction, and give illuminating explanations of many phenomena. 

Our application of weak measurement to counterfactuals does not depend on the foregoing interpretation. The most 
straightforward part of our claim is that, if a weakly coupled measuring device indicates a displacement of pointers 
in some region of an apparatus, then one cannot claim that the state of the system was unaltered in that region; for 
example, in the case of an optical device, such a shift would indicate that a photon was present. The importance of 
sequential weak measurements in this context is illustrated by the double interferometer (Figure [T]) . If two pointers 
are coupled to the paths B and F in this apparatus, each pointer individually will show no displacement on average 
after many runs of the experiment. However, the product of the positions of the pointers will show a shift. Thus the 
photon reveals its presence only when information from both pointers is suitably combined. 

The other part of our claim about counterfactuals can be summed up by what we might call the principle of weak 
detectability: 

An event that cannot be detected by any possible weak interaction does not take place. 

This means that we learn a fact X about an event counterfactually from a certain experiment if (1) the outcome 
of the experiment implies X, and (2) no possible weak interaction can detect the occurrence of this event during the 
experiment. It seems as though part (2) might be hard to confirm, because there is a great variety of possible weak 
interactions. However, this condition proves to be equivalent to the vanishing of all sequential weak values associated 
to the event in question, and this will often be much easier to check. 

Finally, we mention the striking fact that sequential weak values are formally closely related to amplitudes. Consider 
the case where we measure n projectors Pxi , . . . Px„ that define a path between the initial and post-selected states 
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and IV'/): respectively. We can write 



. . _ {-^fpn+AXn) {Xr,\Un\Xn-i) . . . {X^\U^\^^) ^ Amplitude (tt^) 

(/^x„,...,/-^J» (V-/|f/„...C/i|^,) E.Amplitude(^.)' ^ ' 

where tt^ runs over all paths between j-^j) and l^/"/)- Nonetheless, weak values are like measurement results rather 
than amplitudes! This way of looking at sequential weak values suggests a close connection with path integrals that 
remains to be explored. 
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APPENDIX A: CALCULATION OF GENERAL CORRELATIONS 

With Assumption A, wc show here that the general version of ([TS]) is 

(gig2...9n) = 2^ i?e^^(A,^,...,A,J„(Aj^,...,^jJ^. (Al) 

where the weak values in this formula are given bv 1181 In (|Aip the sum is over all ordered indices i = (ii, . . . v) 
with ip < ip+i for 1 < p < ?" — 1, and ordered indices j = (ji, . ■ -js) that make up the complement of i in the set of 
integers from 1 to n, i.e. that satisfy (ii, . . . v) U (ji, . . . js) = (1,2,... n) and (ii, . . . v) H (ji, . . . js) — 0. We include 
the empty set as a possible set of indices. In order not to count indices twice, we require r > s, and when r = s we 
require ii = \. 

For instance, with n — 2, the possible indices are i = (1, 2), j = 0; i = (1), j — (2), which yields 

{qiq2) = Y [(^2, Ai)^ + (Ai)^(A2),„J . (A2) 

This is just equation US]). For n 3 we have i = (1,2,3), j = 0; i = (1,2), j = (3); i = (1,3), j = (2); i = (2,3), 
j = (1), giving (fT9|). Equation (|A1|) is proved in the same way as (fTSj). the state of the n pointers after post-selection 
being: 

■^M,...M,^ = (^/|(t/„+ie-'9f"^";7„...[/2e-''9Pi^i(7i)|^,)0(gi)...(/)(q„), (A3) 

= (^/l {Un+l (Hln) - gAnq^'iqn) + ...)[/„... C/2 (1 " 3^10' (qi) + . . .) Ul) \^^) , 

= (^,|f/„+i[/„...C/i|V'.) [ 1 + .9 E ^ (-^')"' + E '^^igii^i^^f {A„A,U + ..] ct>{qi)...Hqn). 

Assumption A implies that only the terms in qiq2 ■ ■ .qn in |^'a^i...ai„ P need to be taken into account in calculating 

_ / qiq2 . . . qn\^ Mi-.-M^dqi . . . dg„ 



(91 92 ■ 



/ \^Mi...M„?dqi ...dqn 



and this leads to (jAl[) . 

We can also calculate {piP2 ■ ■ -Pn), the product of the momenta of the pointers. To do this, it is convenient to move 
to the momentum basis, replacing (j){q) by its Fourier transform 4>{p) and carrying out an expansion in the pi: 

*A^i...X„ = {i^f\{aa+ie-''''"^"U,,..M2e-'SP'^'Ui)\i^,)4>{pi)...HPn), (A4) 
= {tljflUn+iUn ■ ■ ■ Ui\ij.,) 1 - ig'^Pi{Ai)^, + {-ig)'^'^piPj{Aj,Ai)^ + . . . 0(pi) . . . 4>{Pn)- 

\ i i<3 I 
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Assumption A implies that only the terms in piP2 ■ ■ -Pn in |^'a^i...ai„ P need be considered in calculating 



, , !^Mi...M„Pl---Pn'^Mi...M^dpi...dpn , . 

{piP2 ■■■Pn) ^ -FT-r^ t:^ 3 ■ ( A5) 

J \^Mi...mA dpi ■■■dpn 

It is simplest to treat the cases of n even and odd separately. For the even case we have 

{piP2 ■ ■ ■P2m) = 2{^\r{gvf^ Re ^ ^(-l)'^(A,^, . . . , ^,J^(A,,, . . . , (A6) 

r>s ij 

and for the odd case: 

(piP2 . ■■P2ra+i) = 2{-ir+\gvf^+^ /m ^ ^ (- 1)"^ ( , . . .,A,J^(A,,,...,^,J^, (A7) 

r>s iJ 

where v = j p^4>^{p)dp. 

The case of mixed products of positions and momenta are treated similarly, and they depend only on the real or 
imaginary parts of the sequential weak values given by (jlSp . For example, to calculate ((71P2) we express the first 
variable in the position basis and the second in the momentum basis: 

^^M^.M2 = {■4^f\U3U2Ui\tl;,) (^(j){qi)cj>{p2)+g{Ai)^(P'{qi)<f>{p2) - ig{A2U(f>{qi)p24>{P2)+i9^{A2,Ai)^(P'{qi)p24>{P2) 



which yields pT|) . For these mixed products, since there is a factor of « for eachp in the product, we take the imaginary 
part of weak values when there is an odd number of p's present and the real part otherwise. 

Thus all possible expectations of products of position or momentum can be obtained from the sequential weak 
values. 
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